A Combination of Testability and Decodability by Tensor Products 



o 

(N 

>>: 



u 



00 



Michael Videmian* 
Computer Science Department 
Technion — Israel Institute of Technology 
Haifa 32000, Israel 

viderman@cs . technion .ac.il 

January 26, 2013 



0\ ■ Abstract 
(N 

Ben-Sasson and Sudan (RSA 2006) showed that repeated tensor products of Hnear codes with a very 
large distance are locally testable. Due to the requirement of a very large distance the associated tensor 
products could be applied only over sufficiently large fields. Then Meir (SICOMP 2009) used this result 
(as a black box) to present a combinatorial construction of locally testable codes that match best known 



I parameters. As a consequence, this construction was obtained over sufficiently large fields. 

^ In this paper we improve the result of Ben-Sasson and Sudan and show that for any linear codes the 

associated tensor products are locally testable. Consequently, the construction of Meir can be taken over 
any field, including the binary field. 

Moreover, a combination of our result with the result of Spielman (IEEE IT, 1996) implies a con- 



> 

■ struction of linear codes (over any field) that combine the following properties: 



• have constant rate and constant relative distance; 

• have blocklength n and testable with n'^ queries, for any constant e > 0; 



in 

■ • linear time encodable and linear-time decodable from a constant fraction of errors. 

Furthermore, a combination of our result with the result of Guruswami et al. (STOC 2009) implies a 
similar corollary regarding the list-decodable codes. 



u ' 1 Introduction 



Locally testable codes (LTCs) are eiTor correcting codes that have a tester, which is a randomized algorithm 
with oracle access to the received word x. The tester reads a sublinear amount of information from x and 
based on this local view decides if x € C or not. It should accept codewords with probability one, and reject 
words that are far (in Hamming distance) from the code with noticeable probability. 

Such codes are of interest in computer science due to their numerous connections to probabilistically 
checkable proofs (PCPs) and property testing (see the surveys [27, 13] for more information). By now 
several different constructions of LTCs are known including codes based on low-degree polynomials over 
finite fields [21, 1], constructions based on PCPs of proximity/assignment testers [2, 11] and sparse random 
linear codes [8, 18, 19]. In this paper we study a different family of LTC constructions, namely, tensor 
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codes. Given two linear error correcting codes C C , R C F"^ over a finite field F, we define their 
tensor product to be the subspace R ® C CI f"i^"2 consisting of ni x n2 matrices M with entries in F 
having the property that every row of M is a codeword of R and every column of M is a codeword of C. If 
C = Rwe use to denote C » C and for i > 2 define C' = C (^C'-^. Note that the blocklength of C' 
is n\. 

Recently, tensor products were used to construct new families of LTCs [5, 22], new families of list- 
decodable codes [15], to give an alternative proof (see [23]) for IP=PSPACE theorem of [24, 25] etc. 

Ben-Sasson and Sudan [5] suggested to use tensor product codes as a means to constmct LTCs com- 
binatorially. A natural hope would be to expect that given a code C C F^^, whenever the task is to test 
whether a word M G F"i is in C^, the tester for can choose a random row (or column) of M; and if 
M was far from C^, its typical row/column is far from C and hence can be tested on being in C. As was 
shown in [28, 14, 9] this approach fails in general and is known to work only for the base codes having some 
non-trivial properties [12, 6, 7]. 

Nevertheless, Ben-Sasson and Sudan [5] showed that taking the repeated tensor products of any code 
C C F" with sufficiently large distance results in a locally testable code with sublinear query complexity. 

More formally, they showed [5, Theorem 2.6] that for every m > 3 if > | then is locally 

testable using queries. Note that the blocklength of C" is ?i™ and query complexity is n^. Hence, for 
example, if m = 10 we obtain a code with blocklength N = n^^ and query complexity N^'^ = v?, under 

assumption that > |. 

Let us explain some issues that remained open. First of all, it was remained unclear if the assumption 
about a very large distance of the base codes is necessary. Moreover, the requirement on the distance of 
the base code is dependent on the number of tensor products (?n) one should apply. Note that less query 
complexity (relatively to the blocklength) one should get more tensor product operations should be applied. 
Thus the requirement about the distance of the base code is increased when the number of queries one 
should get is decreased. We notice also that the larger distance implies the larger underlying field. It follows 
that this result can not provide (via tensor products) arbitrary low sublinear query complexity {N"^ for every 
constant e > 0) over a fixed field. 

In this paper we ask the following question: is it possible to achieve a similar result to [5] but with no 
requirements about the base codes at all. A positive result to this question might seem surprising since it 
would imply that any linear enor-conecting code can be involved in the construction of LTCs via tensor 
products. 

We give a positive answer on this question and show that no assumptions about the base codes (or 
underlying fields) are not needed. Our result does not make any assumptions about the base codes, and 
in particular we do not assume that the base codes involved in tensor products have very large distance 
and hence it holds over any fields. This contrasts with previous works on the combinatorial constnictions of 
LTCs due to Ben-Sasson and Sudan [5] and Meir [22] which required very large base-code distance implying 
large field size. The constructions of best known LTCs [4, 10, 22] were obtained over the large fields (when 
finally, the field size can be decreased through code concatenation). Our improvement of [5] implies that 
the construction of Meir [22] (which achieves LTCs of best known parameters) can be taken directly over 
any field (including the binary field). We think that this improvement has a non-negligible role since the 
LTCs construction of Meir is combinatorial and the combinatorial constructions of LTCs (or PCPs) should 
be independent, as much as possible, of the algebraic terms such as "polynomials", "field size", "extension 
field" etc. 

Given the fact that error-correcting codes play an important role in a complexity theory, and in particular. 
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in different iterative protocols, it might be helpful to develop a general scheme for constructing the error- 
correcting codes that combine several different properties. E.g., it might be helpful to have a high-rate codes 
which combine such properties as local testing, efficient encoding and decoding from a constant fraction of 
errors. 

We show that a combination of our results with the results of [26, 15] implies the construction of high- 
rate codes which are both testable with sublinear query complexity, linear-time encodable and efficiently 
decodable (or list-decodable) from the constant fraction of errors. 

Organization of the paper. In the following section we provide background regarding tensor codes and 
locally testable codes. In Section 3 we state our main results. We prove our main theorem (Theorem 3.2) in 
Section 4. Finally, in Section 5 we prove our auxiliary statements. 

2 Preliminaries 

Throughout this paper, F is a finite field, [n] denotes the set {1, . . . ,n} and F" denotes F'"!. All codes 
discussed in this paper will be a linear. Let C C F" be a linear code over F. 

For w G F" let supp(w) = {i\wi / 0}, \w\ = \ supp(tt;)| and wt{w) = We define the distance 
between two words x,y £ to be A {x,y) = \{i \ Xi ^ yi}\ and the relative distance to be 6{x,y) = 
^^^'^^ . The distance of a code is defined by A (C) = mmx:^y(zc A (x, y) and its the relative distance is 
denoted S{C) = ■ A [n, k, (i]F-code is a /c-dimensional subspace C C F" of distance d. The rate of the 
code C is defined by rate(C) = ^iHlM. For x G F" and C C F", let 5{x, C) = 5c{x) = min y)} to 

denote the relative distance of x from the code C. We note that A (C) = min {wt(c)}. If 5{x, C) > e 

cec\{o} 

we say that x is e-far from C and otherwise x is e-close to C. We let dim(C) denote the dimension of C. 
The vector inner product between u = {ui,U2, ■ ■ ■ , ?i„) G F" and v = {vi,V2, ■ ■ ■ , Vn) G F" is defined 
to be {u, v) = Yyi<=[n] ■ ""'i- C"^ = {u G F*^ I Vc G C : {u, c) = 0} be the dual code of C and 

= |n G C"*" I |m| = t|. In a similar way we define C<j = {u G C"*- | \u\ < t]. For t G F" and 
T C F" we say that t _L T if (t, t') = for all t' G T. 

For 1(7 G F" and 5" = {ji,j2, • • • ,jm} Q [n], where ji < j2 < ■ ■ ■ < jm, we let w\s = {wj^,. . . , wj^) 
be the restriction of w to the subset S. We let C\s = {c\s \ c G C} denote the restriction of the code C to 
the subset 5. 

2.1 Tensor Product Codes 

The definitions appearing here are standard in the literature on tensor-based LTCs (e. g. [12, 5, 22, 7, 28]). 

For a; G F^ and y G F'^ we let x ® y denote the tensor product of x and y (i. e., the matrix M with 
entries M(j = Xi ■ yj where £ I x J). Let i? C F^ and C C F"' be linear codes. We define the 
tensor product code i? (g) C to be the linear space spanned by words r (g) c G F^^-^ for r G R and c G C. 
Some known facts regarding the tensor products (see e. g., [12]): 

• The code R ^ C consists of all / x J matrices over F whose rows belong to R and whose columns 
belong to C. 

• dim{R ®C) = dim{R) ■ dim(C) 
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• rate(i? (g) C) = rate{R) ■ rate(C) 

• 6{R ® C) = 6{R) ■ 6{C) 

We let C^ = C and C* = C^~^ (g) C for t > 1. Note by this definition, C^" = C and C^* = C^*"' ® C^*"' 
for i > 0. We also notice that for a code C C F" and m > 1 it holds that rate(C™) = (rate(C))"', 
d{C"') = {6(C))"' and the blocklength of C"" is n'". 

The main drawback of the tensor product operation is that this operation strongly decreases the rate and 
the distance of the base codes. We refer the reader to [22] which showed how to use tensor products and 
avoid the decrease in the distance and the strong decrease in the rate. 

2.2 Locally testable codes and Robustly Testable Codes 

A standard q-query tester for a linear code C C F" is a randomized algorithm that on the input word 
€ F" picks non-adaptively a subset / C [n] such that \I\ < q. Then T reads all symbols of w\i and 

accepts if w\i G C\i, and rejects otherwise (see [3, Theorem 2]). Hence a g-query tester can be associated 

with a distribution over subsets / C [n] such that |/| < q. 

For purposes of composition we want to define a generalized tester (Definition 2. 1) which does not make 

queries, but selects and returns a "view" (a subset / C [n]) which can be considered as a code by itself {C\i). 

Definition 2.1 (Tester of C and Test View). A q-query tester D is a distribution D over subsets / C [n] 
such that |/| < q. Let u; € F" (think of the task of testing whether w ^ C) and let / C [n] be a subset. 
We call 'w\i the view of a tester. If G C|/ we say that this view is consistent with C, or when C is clear 
from the context we simply say is consistent. 

When considering a tensor code C F"™, an associated tester will be a distribution over subsets 
/ C [n]™. Although the tester does not output accept or reject, the way a standard tester does, it can be 
converted to output accept, reject as follows. Whenever the task is to test whether w ^ C and a subset 
/ C [n] is selected by the tester, the tester can output accept if w\i G C|/ and otherwise output reject. 

Definition 2.2 (LTCs and strong LTCs). A code C C F" is a {q, e, (5)-LTC if it has a g-queiy tester D such 
that for all w G F", if 6(w, C) > 5 we have Pr \w\i 4. CIA > e. 

A code C C F" is a {q, e)-strong LTC if it has a q-queiy tester D such that for all w G F", we have 
Pr [w\i ^ C\i] >e-6{w,C). 

We notice that a {q, e)-strong LTC is a (g, e6, 5)-LTC for every S > 0. Note that given a code C C F", 
the subset / C uniquely defines C\i. Moreover, the linearity of C implies that Cj / is a linear- subspace of 
F^. In the rest of this section we formally define the notion of robustness (Definition 2.4) as was introduced 
in [5]. To do that we start from the definition of local distance (Definition 2.3), which will be used in 
Definition 2.4 and later in our proofs. 

Definition 2.3 (Local distance). Let C be a code and w\ibe. the view on the coordinate set / obtained from 
the word w. The local distance of w from C with respect to / (also called the /-distance of w from C) is 
A{w\i,C\i) = min {A (wl/, c|/)} and similarly the relative local distance of w from C with respect to / 

(relative /-distance of w from C) is 6{w\j,C\i) = m.m{6{'w\i, c\j)}. 
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Informally, robustness implies that if a word is far from the code then, on average, a test's view is far 
from any consistent view that can be accepted on the same coordinate set /. This notion was defined for 
LTCs following an analogous definition for PCPs [2, 10]. We are ready to provide a general definition of 
robustness. 

Definition 2.4 (Robustness). Given a tester (i. e., a distribution) D for the code C C F", we let 
p^{w)= E [(5(t(;|/, C|/)] be the expected relative local distance of input 

We say that the tester D has robustness p^{C) on the code C if for every it; G F" it holds that p^{w) > 
p^{C)-6c{w). 

Let {Cn}n be a family of codes where C„ is of blocklength n and Dn is a tester for C„. A family of 
codes {C„}„ is robustly testable with respect to testers {Dn}n if there exists a constant a > such that for 
all n we have p^" (C^) > a. 



3 Main Results 

The tester we consider in this paper is the plane tester (suggested in [5]). 

Definition 3.1 (Plane Tester). Let m > 3. Let M € F"'" be an input word and think of testing whether 
M G C"™ . The plane tester V picks (non-adaptively) a random h € [3] and random i G [n] , and returns 
(6, i)-plane (the corresponding local view is Af |(f, i)). Note that if M is a candidate word to be in then 
M|(^ is a candidate word to be in C™^^. 

Now we state our main technical theorem which says that the tensor product of any base code (with 
constant relative distance) is robustly testable. This extends the result of [5] which showed that this claim 
holds for base codes with a very large distance. 

Theorem 3.2 (Main Theorem). Let C QY"^ be a linear code and m > 3. Let V be the plane tester for C"^. 
Then 



2m? 

The proof of Theorem 3.2 is postponed to Section 4. Theorem 3.2 extends the main result of Ben- 
Sasson and Sudan [5] since it implies that the m-wise tensor product of linear codes is robust for any linear 
base codes with constant relative distance. In particular, the tensor product can be applied over any field, 
including the binary field. So, as explained in the introduction, the combinatorial construction of LTCs in 
[22] can be taken over any field (regai^dless of the field size). 

Ben-Sasson and Sudan [5] explained that plane testers can be composed and the robustness of the plane 
testers implies the strong local testability. For the sake of completeness we state this claim fonnally in 
Corollary 3.3, and provide a proof-sketch in Section 5 (see [5, 6] for more information about composition 
of the testers). 

Corollary 3.3. Let C C F" be a linear code and m > 3 is a constant. Then is a {v?, am)-strong LTC, 
where am > is a constant that depends only on m and 5{C). Note that the blocklength ofC"^ is n^. 

Corollary 3.3 implies that any linear code can be used to define a locally testable code with sublinear 
query complexity. Claim 3.4 shows that if a linear code C is linear-time encodable then so is C* for any 
constant i. Later we will use this claim together with Corollary 3.3 to show Corollary 3.5. 
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Claim 3.4. Let m > 1 be a constant. If C CI F" is a linear-time encodable linear code then C"^ is 
linear-time encodable. 

The proof of Claim 3.4 is postponed to Section 5. Now, we combine Corollary 3.3 and Claim 3.4 to 
show a simple construction of strong LTCs with arbitrary small sublinear query complexity and arbitrary 
high rate from any linear code with sufficiently high rate. 

Corollary 3.5. Let C C F" be a linear code and let m > 3 be a constant. Then C"^ C F" 
strong LTC, where am > is a constant that depends only on m and (5(C). 

In particular, for every e > 0, m = \^], N = ri^ and C C F" such that rate(C) > (1 — e)^/™ we 
have C F"^ is a {N'', a)-strong LTC and rate(C"*) > 1 — e, where a > is a constant that depends 
only on e. Moreover, if C is a linear-time encodable then C" is a linear-time encodable. 

Usually, in the areas of locally testable and locally decodable codes the main interest was given to the 
constant query complexity. Recently, Kopparty et al. [20] showed the construction of high-rate locally de- 
codable codes with sublinear query complexity (see [20] for the motivation behind this range of pai^ameters). 
Since then, the interest to the other range of parameters, and in particular, to sublinear query complexity was 
increased. 

We would like to stress that Corollary 3.5 is quite powerful for this range of parameters (sublinear query 
complexity and high rate). First of all, there are different constructions of linear- time encodable codes 
with constant rate and constant relative distance [16, 17, 26], and them all can be involved to define high- 
rate LTCs with sublinear query complexity that are linear-time encodable. The other advantage of such 
constructions is that the repeated tensor product of the base code is known to inherit some properties of 
the base codes besides local testability. E.g., Gopalan et al. [15] showed that the tensor product operation 
preserves list-decodability properties. Furthermore, we know about nice constructions of error-conecting 
codes that can be efficiently encoded and decoded (list-decoded) from a constant fraction of errors (see e.g., 
[16, 17]). 

In Section 3.1 we show how testing with sublinear query complexity can be combined with a linear time 
encoding and decoding. Then, in Section 3.2 we show that Corollary 3.5 can be combined with the result of 
[15] to define asymptotically good codes that can be encodable in linear- time, testable with sublineai^ query 
complexity and list-decodable in polynomial time. 

3.1 Linear- time decodable codes 

Proposition 3.6 shows that the tensor product operation preserves the "unique-neighbor" decoding property. 
In particular, if C is a linear- code that is linear- time unique-neighbor decodable from a constant fraction 
of errors then so is C^. Hence this observation, together with a result of, e.g. [26], can result in the 
construction of asymptotically good locally testable codes with sublinear query complexity that can be 
linear-time encoded and decoded to the closest neighbor after a constant fraction of errors. 

Proposition 3.6. Assume C C F" is a linear code that is linear-time decodable from a ■ n errors. Then 
C"^ = C ® C is a linear code that is linear-time decodable from ^ • "n? errors. 

Proof. Let Decc be a linear-time decoder for the code C that can coiTcct any a ■ n errors. Note that in 
particular, Decc correct any a ■ n erasures in the linear time. We define the linear-time decoder DecQ2 for 
the code that will correct any ^ • errors. 

To do this, let M G F"^*^ be an input word. The decoder Decc2 will decode every row of M using 
Decc and every column of M using Decc- Note that every entry of M is contained in (exactly) one row 
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and one column. Call the entiy of M an inconsistent if row decoding gives to M|(j j) a different value 
from column decoding, and otherwise the entry is called consistent. 

We call the row (column) of M bad if it contains at least an inconsistent entries. Let Badr be a 

2 2 

number of bad rows and Bade be a number of bad columns. It holds that Badr ■ cm < and hence 
Badr < an/100. Similarly, Bade < an/100. 

The decoder Decc2 removes all bad rows and bad columns that have at least an/2 inconsistent entries 
and obtains a large submatrix of size at least (1 — a/100)n x (1 — a/100)n. It is easy to see that all consistent 
entries in the above large submatrix were decoded con^ectly. 

In the last step, the decoder DeC(j2 decodes, using Decc, every row of the large submatrix of M (of 
size at least (1 — a/100)n x (1 — Q!/100)n) from at most an/100 erasures and obtains a submatrix of size 
at least (1 — a/100)n x n. Now, it decodes every column of the submatrix to the full matrix. It can be easily 
verified that the decoder DecQ2 obtains a correct codeword of and runs in linear time. □ 

While the results of [26] were improved, for our purpose (Corollary 3.8) this result is sufficient. 

Theorem 3.7 ([26]). There exists an (explicit) family of linear error correcting codes C C such that 
rate(C) = 0(1), 5{C) = 0(1), C is a linear-time encodable and linear-time decodable from the constant 
fraction of errors. 

A combination of Theorem 3.7, Proposition 3.6 and Claim 3.4 results in the following corollary. 

Corollary 3.8. For every constant e > there exists an ( explicit) family of linear error correcting codes 
C C (obtained by tensor products on the codes from Theorem 3.7) that 

• have constant rate and constant relative distance, 

• linear time encodable and linear time decodable from the constant fraction of errors, 

• are (N'' , a) -strong LTCs, where a = a(e) > {) is a constant. 



3.2 Locally testable and list-decodable codes 

In this section we recall some constructions of the hst-decodable codes. We start by defining the list- 
decodable codes. 

Definition 3.9 (List-decodable codes). A code C is a (a, L)-list decodable if for every word w G F", 
5{w, C) < a we have |{c G C | 5(c, w) < a}| < L. The code is said to be (a, L)-hst decodable in time T 
if there exists algorithm which on the input G F" such that 6{w, C) < a outputs all codewords c G C 
such that 5{c, w) < a (at most L codewords). 

Guruswami et al. [15] showed that the list-decodability is presei-ved in the tensor product operation. 
More fomially, they showed the following theorem stated in [15, Theorem 5.7]. 

Theorem 3.10 ([15]). Let F be a finite field and q = |F|. Given two linear codes Ci, C2 ^ F", for every 
e > 0, the number of codewords 0/C2 (8) Ci within distance Vj* = min(Ji772, (52??i) — 3e of any received 

1 In ^Mnil 



word is bounded by /(C2 Ci, r/*) < ^q^^i' 

Further, if C\ and C2 can be efficiently list decoded up to error rates rji , 7]2 and C2 is a linear code, 
then C2 ^ Ci can be list decoded efficiently up to error rate rj*. Specifically, ifT denotes the time com- 
plexity of list decoding Ci and C2, then the running time of the list decoding algorithm for C2 <S) Ci is 



1 In ^'i'*?!) In ^'2('?2) 



0(4g4ip- . - . .rnin2) 
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Then, Gopalan et al. used Theorem 3.10 to conclude the following theorem, appearing in [15, Theorem 
5.8]. 

Theorem 3.11 ([15]). Let C be a linear code with distance 5, list decodable up to an error rate i]. For every 
5 > 0, the m-wise tensor product code C™ can be list decoded up to an error rate 5'^~^ri — e with a list 
size exp((0(i^^-^^^^ )) '"). Moreover, ifm > 1 is constant and C is polynomial-time list decodable then the 
running time of the list decoding algorithm for is polynomial (depending on m). 

The next fact is known due to the several constructions of list-decodable codes. 

Fact 3.12. There exist linear error-correcting codes of constant rate and constant relative distance that can 
be encoded in linear time and list-decoded in polynomial time. 

We use the combination of Theorem 3.11, Fact 3.12, Claim 3.4 and Corollary 3.5 to conclude the fol- 
lowing corollary. 

Corollary 3.13. Let F be any field. For every constant e > there exists a code C C F^ such that 
C = C"\ where C C F*^ is a linear code, rate(C) = f2(l), 5{C) = r2(l) and C is (p, L)-list decodable 
in polynomial time. 

• C' is a {N^, a)-strong LTC, where a = a(e) > Q is a constant, 

• C is linear time encodable and list-decodable in polynomial time from the constant fraction of errors, 

• rate(C') > Vt{l) and 5{C') = Vt{l). 

4 Proof of Theorem 3.2 

Throughout this paper we assume that C C F" is a linear code. We shall consider an m-wise tensor product, 
i.e., C" C F"'". Note that the blocklength of C" is n*". Throughout this paper we assume that m > 3 
and for the case of m = 2 we refer a reader to [6, 7, 12, 28, 14, 9, 9]). We start this section by defining the 
concepts of points, lines and planes (some of the terms were defined following [5]). 

4.1 Preliminary notations: Points, Lines and Planes 

A point in such a code can be associated with an ?Ti-tuple (ii, Z2, ^m) such that ij S [n]. Next we define 
an axis parallel line, or shortly, a line which can be associated with a subset of points. For h G [m] and 
i G [n] we say that / is a (6, {ii,i2, ■■.,ib-i,h+i, 'im))-line if 

I = {ik,i2, ■■■,ib-i,i,ib+i, ■■■,im) \ for all j G [m] \ {6} we have ij=ij}. 

Note that [b, {ii,i2, ift-i, ib+i, im))-line is parallel to the 6-th axis. A line / contains a point p if p G /. 
Note that a (6, (zi, Z2, ib-i,ib+i, ^m))-line contains a point p = {ji,i2, ...,im) if for all G [m] \ {b} 
we have = jk- Two (different) lines intersects on the point p if both lines contain the point p. 
We say that pi is a (6, i)-plane if 

pi = {{ii,i2, ■■■,im) \ H = i and for all j G [m] \ {b} we have ij G [n]}. 

A (6, z)-plane contains a point p = {ii,i2, ■■■,im) if ib = i, i-C-, b-th coordinate of the point is i. A (6, i)- 
plane contains a line I if it contains all points of the line. We say that two (different) planes are intersected if 
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both planes contain at least one common point. Note that two (different) planes: (61, ii)-plane and (62, ^2)- 
plane are intersected iff 61 7^ b2, moreover, they are intersected on all points p = (ii, . . . ,im) such that 
ii = ifej and ^2 = ^62' i-^-' intersected on n™^^ points. 

Assume that ph is a (61, ii)-plane and pl2 is a (62, i2)-plane such that 61 < 62 (in particular 61 7^ 62)- 
Letp/i npZ2 = {(^1, • • • , im) I ^fei = ^1, ^62 = ^2} be an intersection of two planes and C^^lpi^nph be a code 

restricted to the points in ph n ph- Note that SiC'^lpi.nph) = ^(C""-^) = 5(C)™-2. 

Given a word M € F"™, 5 € [m] and i € [n] we let M(fe j) be a restriction of M to the (6, i)-plane, i.e., 
to all points of the plane. We say that M(b,i) is a {b, i)-plane of M. Similai^ly, for the point p = (ii, . . . , im) 
let Af Ip be a restriction of M to the point p and for the line / we let M|/ be a restriction of M to the line /. 
We say that M|; is a line / of M. 



4.2 The proof itself 

Let M G F*^" be an input word. We prove that p^(M) > ^^^^^5{M, C™). 

For every plane pi of M let r{pl) be the closest codeword of C"^~^ to M\pi (if there are more than one 
such codewords fix anyone arbitrarily). Intuitively, the plane pi of M "thinks" that the symbols of M\pi 
should be changed to r{pl). In this sense every plane of M has its own "opinion". Then we have 

/9^(M) = E [6{M\pi,r{pl))]. (4.1) 

We say that the (61, ii)-plane and the (62, *2)-plane disagree on the point p = {ii, . . . , im) if {bi,ii)- 
plane and (62, i2)-plane are intersected, both contain the point p and r{pli)\p / r{pl2)\p. We say that two 
planes disagree on the line I if both planes are intersected, both contain the line I and r{pli)\i ^ t{j)12)\i. 

Note that if (61, ii)-plane pl\ and (62, ^2)-plane p/2 are intersected and disagree on at least one point 
then letting reg = ph (1 pl2 we have r{pli)\reg / r{pl2)\reg and moreover, S{r{pli)\reg,r{pl2)\reg) > 
((^(C))"*"^. This is true since r{pli)\reg,r{pl2)\reg € C^~^ are non-equal codewords of C""~^ and 
5(C""-2) = (SiC))""-^. 

Let € F2 be a binary matrix such that E\p = 1 if at least two planes disagree on the point p, and 
otherwise E\p = 0. For the point p we say that the point is almost fixed if E\p = but p is contained in 
some plane pi such that r{pl)\p ^ M\p. Intuitively, a point p is almost fixed if all planes containing this 
point agree on this point but "think" that its value in M (A/|p) should be changed (to r{pl)\p). 

We let ToFix = = (zi, 12, . . . , im) \ P is almost fixed} and let NumToFix = \ToFix\. 

Proposition 4.1. It holds that p^{M) > + ^"^^1°^^^ . 

Proof. Equation 4.1 says that p^{M) is a relative distance of a typical plane of M (which is a word in 
F"-'""^) from C"""^ Note that for every point p = (ii, . . . , im)- if E\p ^ then p ^ NeedToFix. That 
means for every point p at most one condition is satisfied: E\p ^ Qor p ^ NeedToFix. 

Note also that for every point p € NeedToFix, for all planes pi of M we have {M\pi)\p 7^ r{pl)\p. 
Now, every point p is contained in m different planes. Hence if E'lp 7^ then for at least one plane pi (of m 
planes containing the point p) it holds that r{pl)\p 7^ M\p. 

Hence a relative distance between a typical plane {pi) of Ad and r{pl) is at least '"^^^^ + M™l^^Ei2L_ □ 

Next we define an important concept of "heavy planes (lines)" in the inconsistency matrix E. Intuitively, 
a heavy plane (line) of the matrix is a plane (line) where many inconsistencies occur, i.e., many non-zero 
symbols. 



9 



Definition 4.2 (Heavy lines and planes). A line I of E is called heavy if \E\i\ > 6{C) • n. A plane (6, i) of 

l(M) 



^ is called heavy if > \ 



Lemma 4.3 is our main observation in the proof of Theorem 3.2. It says that any non-zero element of E 
is located in some heavy plane of E. 

Lemma 4.3 (Main Lemma). Let p = {ii,i2, ■ ■ ■ ,im) be a point such that Ep ^ 0. Then p is contained in 
some heavy plane of E. 

The proof of Lemma 4.3 is postponed to Section 4.2. 1 . Corollary 4.4 shows that it is sufficient to remove 
at most wt(£^) • (5(C))'^~^/2 • ^ planes from E to get a zero submatrix. 

'2\E\ 

Corollary 4.4. There exists Si, ... ^ Sm ^ W such that n—\Si\+n — \S2\ + ■■ -+111— \Sm\ < (g(c)n)'"-i 
and letting S" = x 5*2 x . . . x Sm we have E\s = 0. 

Proof. Let HeavyPlanes = {{b, i) \ {b, i) is a heavy plane} to be a subset of pairs associated with heavy 
planes. For b € [m] let 6*5 = {i € [n] \ {b, i) € HeavyPlanes} and 5;, = [n] \ 6*5. 

2 1 £/ 1 

We claim that \HeavyPlanes\ < (^^{cyn)"^-^ ' "^^^^ since every heavy plane contains at least 

^ ^ ' ^' non-zero elements of E and the total number of non-zero elements of is Furthermore, 

every non-zero element of E is contained in at most m (heavy) planes. Thus n — 15*11 + 71— |52| + ...+ 

n - \Sm\ = EbeH l*^^' - WUWr^ ' 

Now, note that Lemma 4.3 implies that for every point p = (zi, Z2, . . . , im) such that E\p 7^ is 
contained in some heavy plane, i.e., in some plane from HeavyPlanes. Hence if all heavy planes are 
removed from E we obtain a zero submatrix. So, it follows that E\s = Q. □ 

Proposition 4.5 says that if after removing small fraction of planes from M we obtain a submatrix that 
is close to the legal submatrix then M is close to C". 

Proposition 4.5. Let 81,82, 8m Q [n] be such that n— l^il+n— |S'2| + ... + ra — \ 8m\ < rn < 6{C) ■ n 
and let 8 = 81 X 82 X ... X 8m- Let C = C\si (8) C\s2 (8) ... (8) C\s^. Recall that M\s is a submatrix 
of M obtained by removing at most r-fraction of planes. Assume that A {M\s,C') < a ■ n"*. Then 
6{M, C™) <T + a. 

The proof of Proposition 4.5 appears in Section 4.2.2. Let us prove Theorem 3.2. 

Proof of Theorem 3.2. By Proposition 4.1 we have p^{M) > + If wt(E) > ^^^^ then 

(5(C))"^ 

we are done. Otherwise, assume that wt(£') < ^ . 

2 1 1 

Corollary 4.4 implies that it is sufficient to remove at most (j((7)„)m-i • < ^{C) ■ n planes from E to 
get a zero submatrix. Proposition 4.5 implies that 5[M, C"") < • rn + NumToFix _ 

Let p = j^^^. Then p^(M) • /3 > + ^""^^°^" ) • /? > 6{M, and p^{M) > ^^^^^ ■ 

5(M,C7™). □ 

4.2. 1 Proof of Main Lemma 4.3 

In this section we prove Lemma 4.3. 
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Proof od Main Lemma 4.3. By definition of E we know tliat tfiere are (at least) two planes that disagree on 
the point p. Assume without loss of generality (symmetry) that the planes pi = (l,ii) and p2 = (2, ^2) 
disagree on the point p. We will prove that either pi is a heavy plane or p2 is a heavy plane. 

Consider the intersection of pi andp2, i-S-, reg = ph fl pl2 = {(«i, ^2, js, J4, • • • ,jm) \ jk G N}- Note 
that p G reg. Let In be a line, which is parallel to the axis 3 and contains a point p (recall that m > 3). Then 
the planes pi and p2 disagree on this line (since they disagree on the point p contained in the line In), i.e., 
r{pi)\in + r{j)2)\in- But r(pi)|,„,r(p2)hn e C by definition. This implies that A (r(pi)|to, r(p2)bn) > 
5(C) • n, i.e., for at least b{C) ■ n points p G in it holds that r(pi)|p 7^ r(p2)|p- 

Let BadPoints = {p ^ ln\ pi andp2 disagree on p}. Note that \BadPoints\ > S{C) ■ n. Note that 
\BadPoints\ > 6{C)-n. Let BadPlanes = {(3, i) — plane | i G [n],3p G BadPoints s.t. p G (3,i) — plane}. 
Note that \BadPlains\ > 6{C) ■ n. 

We claim that for every plane G BadPlanes we have that either plane disagrees with pi on some point 
p G BadPoints or with p2 on some point p G BadPoints. Hence at least one of pi,P2 disagrees with at 
least i • \BadPlanes\ > ^ • 6{C)n planes from BadPlanes. Without loss of generality assume that pi 
disagrees with at least ^ • 6{C) ■ n planes from BadPlanes. 

Let BadPlaneSp-^ = {plane G BadPlanes \ plane disagrees with pi}. Note that all planes from 
BadPlanes are non-intersecting and thus all planes from BadPlaneSp^ are. non-intersecting. Every plane 
pi G BadPlaneSp^ disagrees with the plane pi on some point and hence disagree on at least (5(C)n)™~^ 
points in their intersection region {pi n pi) since r{pl)\pir\p^ / '"(Pi)lpZnpi £ C™~^. 

Lettotal = {p = {ii,j2, ■ ■ ■ ,jm) I ^plane G BadPlaneSp-^ s.t. p € pi H plane, r{plane)\p r{pi)\p}. 

We have \total\ > {6{C)n)"^~^ ■ = ('^('^)-^) since every intersection region (as above) contains at 

least (5(C)n)"*~^ inconsistency points and there are at least ^ • S{C) ■ n such regions. We stress that we do 
not count any inconsistency point more than once, since the planes in BadPlaneSp^ are non-intersecting. 

(6(C) ^^\m— 1 

Hence the plane pi disagree with other planes in at least 2 points (on the plane). Thus E\p^ 

has at least ^ ^ non-zero symbols. We conclude that pi is a heavy plane of E and the point p is 

contained in the plane pi . □ 



4.2.2 Proof of Proposition 4.5 

In this section we prove Proposition 4.5. 

Proof of Proposition 4.5. Note that for every i G [n] we have \Si\ > n — 5{C) ■ n. The following simple 
claim was proven in [5, Proposition 3.1]. For the sake of completeness we provide its proof. 

Every codeword c' of C" can be extended to a unique codeword c of C^. To see this note that the 
projection of C to C | is bijective. It is surjective because it is a projection, and it is injective because 1 5'i | > 
n — A (C). So, the projection of C to C is bijection, because both codes are of dimension (dim(C))'". 
Thus, every word in C has a unique preimage in C. 

We turn to prove Proposition 4.5. We know that M can be modified in at most a-fraction of points p ^ S 
to get M\s € C. Then, by the claim above, M can be modified (outside the submatrix M\s) to a codeword 
of C" by changing at most r-fraction of symbols (since all symbols outside the submatrix M|s are at most 
T-fraction of all symbols). We conclude that 6{M, C"^) < t + a. □ 



5 Proofs of Auxiliaries Corollaries and Claims 

In this section we first proof Corollary 3.3. 
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Proof Sketch: For i > 3 let Vi be the plane tester for the code C*. Note that the tester returns a local 
view that is a candidate to be in the code C"*"^. Hence Vm~i can be invoked on the local view of Vm, etc. 
So, the testers Vm, Vm~i, ■ ■ ■ can be composed to result in a n^-query tester Vcomp for the code C". 

The robustness of the composed tester will be p^'^'"^p (C"") > p^- (C"") • p^—i (C™"!) ■ . . . ■ p^^ (C^). 
To see this let w € F""" be a word such that S{w, C^) = 6. Then the local view of the tester Vm is 
expected to be p^"" (C'") • S far from C"^"^. When Vm.~i will be invoked, its local view will be /3^'" (C^) • 
p^™-i(C""-i) • 5 far from C'"-^ etc. Finally, the local view of V^ will be (p^'»(C"") • p^--i(C""-i) • 
... -p^HC^)) far from C^. 

Theorem 3.2 says that ^^'"(C"") > Hence for constant m > 3 it holds that p^<:°™p(C™) > 

is a constant that depends only on 6{C) and m. 

Now, let = p^'=°™p(C'™) and note that the query complexity of Vcomp is n^. Let M e F"™ such 
that 6{M, C"') = 6 and think of testing whether M in C"\ We argue that Pri^Vcon.p[M\i G C""|/] > a^- 
This is true since otherwise, Pr/^D^„„p [M|/ ^ |/] < am, and then the robustness (C™) < ■ 1 

(even assuming that whenever M|/ ^ C""|/ we have (^(M|/, C""|/) = 1). Contradiction. 

This proves that C" is a (n^, am)-strong LTC. □ 

Now we prove Claim 3.4. 

Proof of Claim 3.4. Let k = dim(C). Let Ec be an encoder for the code C, which receives a message 
X e F'' and outputs a codeword Ec{x) G C such that C = {Ec{x) | x € F*^}. Assume that Ec has 
running time T = 0{k). Note that this implies that n < T = 0{k) since the blocklength can not exceed 
the mnning time of the encoder. 

For every i > 1 we define E^i to be the encoder for C\ i.e. , C* = | E^i (x) | x G F''"' | . We will argue 

that the running time of E(ji is i ■ • T. Since n < T = 0{k) we will conclude that for any constant 
i>l the running time of Eqi is linear (in A;*). 

We prove the claim by induction on i. The encoder Ec = Eci was defined and its running time is 
T = 1 • n^~^ ■ T. Assume that we defined the encoder Ec^-i for the code C*~^ and its running time is 

(i - 1) • n(*-i)-i • T. 

Let us define the encoder Eci for the code C*. Note that the code C* has message length /c* and its 
blocklength is n*. Hence the message x G F'^' can be viewed as a matrix k x and the encoder Ec^ will 
first encode (by the encoder Eci-i) every row of the matrix, obtaining the matrix k x re*~^. Then Eci will 
encode every column of the obtained matrix to get a codeword of C\ The running time of the encoder C* is 
k-{{i-l)- n'-'^T) + n'-^T < {{i - 1) • n^^^T) + n^'^T = i ■ n^~^ ■ T. □ 
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